(N 



(N 



(N 



Optimal Trading with Linear Costs 

Joachim de Lataillade, Cyril Deremble, 

Marc Potters & Jean-Philippe Bouchaud 

Capital Fund Management, 6 Boulevard Haussmann, 75009 Paris, France 



O 

^ : Abstract 

H - 

C^ . We consider the problem of the optimal trading strategy in the presence of linear costs, and 

with a strict cap on the allowed position in the market. Using Bellman's backward recursion 

method, we show that the optimal strategy is to switch between the maximum allowed long 

position and the maximum allowed short position, whenever the predictor exceeds a threshold 

value, for which we establish an exact equation. This equation can be solved explicitely in the 

case of a discrete Ornstein-Uhlenbeck predictor. We discuss in detail the dependence of this 

threshold value on the transaction costs. Finally, we establish a strong connection between our 

n , ' problem and the case of a quadratic risk penalty, where our threshold becomes the size of the 

r . optimal non-trading band. 

Q-i' 1 Introduction 

Contrarily to the efficient market dogma, prices have some degree of predictability, at least on short 

^ . time scales. Statistical arbitrage strategies aim at exploiting this small predictability. However, 

1^ I costs make it difficult, although not impossible, to eke out a profit from these inefficiencies. Indeed, 

Q>^ ' every trade is costly; the structure of these costs is actually quite complex. Some of them are 

VO ■ related to various fees (market fees, brokerage fees, etc.) and are usually a small fraction of the 

ff^ I traded quantity (typically 10~^ to 10~^ on liquid markets). We call these costs "linear", because 

^^ [ they are simply proportionnal to the traded amount. Another source of linear costs is the bid- ask 

spread, which represents a few "basis points" (bp, i.e. 10~^ of the value of the contract). Much 

more subtle are impact induced costs, which come from the fact that a large order must be split 

in a sequence of small trades that are executed incrementally. But since each executed trade, on 

r> ' average, impacts the price in the direction of the trade, the average execution price is higher (in the 

C^ . case of a buy) than the decision price, leading to what is called "execution shortfall". This cost is 

clearly non-linear, since the price impact itself increases with the size Q of the trade. Empirical data 

suggests that the impact induced cost is on the order of aQ^''^/VV, where a is the daily volatility 

and V the daily turnover (see e.g. |TLD+ll| ). This shows that for an order corresponding to 1% 

of the daily turnover, and for an asset with 2% daily volatility, the impact cost is on the order of 

20 bp. But for much smaller orders, or for assets where the bid-ask spread is large, linear costs can 

be dominant. 

The problem we want to address and solve in this work is to determine the optimal trading 
strategy when one has discovered a predictive statistical signal, in the presence of linear trading 
costs and a constraint on the maximum size of the position (both long and short). While the case of 
a quadratic risk control has been considered in the literature for linear costs |DN90| ISS94| ICon86) , 



quadratic costs (i.e. costs growing like Q^) |DL07|IAC00| or various impact-dependent cost |Alm03| . 
we are not aware of any published solution in the case where the risk constraint is a cap on the 
position. This problem can be of significant interest in practice, because the risk of a trading system 
is sometimes handled completely outside of the system, through such a cap on the position, in order 
to reduce operational risk. 

Our final solution for the optimal position vr as a function of the predictor p is as follows: the 
size of the position |7r| should always be at the maximum allowed position M, with a sign that 
switches between —1 and +1 whenever the predictor exceeds a threshold value q* (and vice- versa 
when the predictor becomes smaller than —q*)- We find an exact equation for the threshold q*, that 
we solve explicitely in the case of a (discrete-time) Ornstein-Uhlenbeck predictor. In some limits 
we find that the threshold q* scales as the one-third power of the cost parameter, a result already 
discussed in the literature but in the context of a quadratic risk contraint. We explain why our 
result is strongly connected with this alternative problem. We also check the validity of our results 
using numerical simulations. 

2 Description of the problem 

We consider an agent who trades a single asset, of current price Pricej. The position (signed number 
of shares/contracts) of the trader at time t is ttj. We assume that the agent has some signal pt that 
predicts the next price change rj = Pricet+i — Pricey, and is faced with the following constraints: 

• His/her risk control system is simply a cap on the absolute size of his position : |7r| < M, 
with no other risk control. M will be called the "MaxPos" of the agent. 

• He/she has to pay linear costs TQ whenever he/she trades a quantity Q = \6Tr\ = {irt+i — vrj| 

The agent wants to maximise his/her expected gains, by trading over a long period [0, T] (we 
will later consider the limit T — )• oo). 
We also assume the predictor to : 

(i) have a linear predictability: £'[rt|pt] = A ■ pt with some constant A 
(ii) be positively auto-correlated: Wq, P{pt-^i > q\pt) increases continuously with pt 
(iii) be Markovian: Va;t_|_i, P{u}t+i\pt,Pt-i, ■ ■ ■) = Pi^t+i\pt) where oJt+i is any event at t -|- 1 
(iv) be symmetric: Vg, P{pt+i > q\pt = p) = P{pt+i < -q\pt = -p) 
(v) be unbounded: Vg, 3eq > s.t. P{pt+i > q\pt = 0) > e^. 
Remarks: 

• Without loss of generality, we can always set A = 1, so that pt is in price units. 

• Because of the positive auto-correlation of pt , we can define an integrated predictability 
at t = oo, depending on pt : 

oo 

PooiPt) = E[ Priccoo - Pricct | pj ] = ^ E[ pt+n \ Pt ] 

n=0 

This quantity indicates how much one will gain in the future if one keeps a fixed position 
TTf'^t = IT : the expected gain is then Poo{pt) tt. 



• Finally, one can consider the case where Condition (jl|) is not met, and note 

^t{pt) = E[rt\pt] 

the immediate predictability. Then, as we will see in Section I3.6| we can still express the 
solution of the system if we suppose that £j is continuous, uneven, strictly increasing, and 
limpj_j.oo -2t(pt) > r. But the parameters of this solution will of course depend on these 
functions £(. 

3 The Optimal Strategy 

3.1 A naive solution 

At first sight, the solution to this problem seems straightforward: if the expected future gain (given 
by the integrated predictability) exceeds the trading cost per contract F, then one trades in the 
direction of the signal (if not already at the MaxPos), otherwise one does not. This solution obviously 
generates a positive average gain, but it has no reason to be the optimal solution. Indeed, because 
the predictor is auto-correlated in time, it might be worthy (and in general it will be) to wait for 
a larger value of the predictor, in order to grab the opportunities that have the most chances to 
get realised, and discard the others. As we shall see, the mistake in this naive reasoning is not to 
compare the future gain with the cost, but rather comes from a wrong definition of the future gain, 
which does not include future trading decisions. 

3.2 The Bellman method 

To attack this problem, we will use Bellman's optimal control theory, or dynamic program- 
ming |Bel03| . which consists in solving the problem backwards: by assuming one follows the op- 
timal strategy for all future times t' > i, we can find the optimal solution at time t. As is usual 
in dynamic programming, we have a control variablqj vrj, which needs to be optimised, and a 
state variable p^, which parametrises the solution. The optimisation will be done through a value 
function Vi(7r,p), which gives the maximal expected gains between time t and -|-oo, considering 
that the position at i — 1 is vr and the predictor's value at t is p. The optimal solution of the system 
will be denoted {T^t)t&\o,T]- 

Let us start with the simple case t = T, i.e. the optimal strategy at the last time step. In this 
case the expected future return is really Poo{p)'^T where p = pT, since no trading is allowed beyond 
that time. Any trade Sn induces a cost r|j7r|, so: 

• if Poo(p) > r then TT^ = M, VT{7r,p) = Poo{p) ■ M - T{M - tt). 

• iipooip) < -r then vr^ = -M, VTiTr,p) = -poo{p) ■ M - T{M + tt). 

• if \Poo{p)\ < r then vr^ = vr, Vt{it,p) = Poo{p) ■ vr. 



^In the usual Bellman terminology the control variable is actually -Kt — 7i"t-i, but here we want to insist on what 
we can control (the position) and what we cannot (the value of the predictor). 



Hence, we recover exactly the naive solution in this case, but this is only because there is no trading 
beyond t = T. 

Now if we consider t < T, we have to maximise a quantity including immediate gains, costs and 
future gains. This leads to the following recurrence relation: 

Vt{TT,p) = max (p-tt' - r|7r' - tt\ + P{pt+i =p' \pt=p) Vt+iiir' ,p') dp' ) (1) 

W\<M J 

and VTj is the value of tt' which realises this maximimum when vr = vr^.i and p = Pt- 

3.3 General solution 

In what follows, we will need the following notations: 

P{p'\p) = P{pt+i =P \pt=p) 
'P>q{p) = P{Pt+i > q\pt = p) 
'P<q{p) = P{Pt+i <q\ Pt=p) 

where the dependency on t is kept implicit. 

Proposition 1. There exist two functions g{t,p) and h{t,p) and a sequence {qt)te\o,T] such that, 
for every t G [0, T], we have the following: 

vr;_i if\pt\ < Qt 

M if Pt > Qt (with 7r_i = 0) 

^-M ifpt<-qt 

{g{t,p)TT + h{t,p) if\pt\ < qt 

{g{t,p)-r)-M + rTT + h{t,p) ifpt>qt 
{-g{t,p) -T)-M-Vtt + h{t,p) ifpt < -qt 

g{t,p) is a continuous, strictly increasing function of p which satisfies, for t < T: 

/<it+i 
P{p'\p)g{t + l,p')dp' (2) 

-<?t+i 

qt is such that qt > and 

9it,qt)=T. (3) 



Proof. The proof is done by backwards recursion {ie. we assume that it is true for t+ 1 to prove 
that it is true for t). One can easily check from section (|3.2p that the statement is true for t = T, 
with in particular g{T,p) = Poo{p)- 



Let us then suppose that it is true for t + 1. We have the following: 

Vt{7r,p) = max { p ■ n' - T\tt - vr'l + [ P{p'\p) Vt+iiir' ,p') dp' ) 

k'|<A/ J 

/qt+i 
P{p\p) ■ W + 1,pW + h{t + l,p')]dp 
-qt+l 

r+oo 

+ / P{p'\p) ■ [{g{t + l,p')-r)-M + Ttt' + h{t + l,p')]dp' 

r-qt+i 
+ / P{p'\p) ■ [{-g{t + l,p') - r) • M - Ttt' + h{t + l,p')W 

J —oo 



So if we set 



g{t,p)=p + V-[V>,,^M-'P<~<it+.ip)]+ / P{p'\p)g{t + l,p')dp' 



and 



r+oo r~gt+i 

h{t,p) =n + M P{p'\p) . [g{t + l,p') -T]dp' + M P{p'\p) ■ [-g{t + l,p') - T] dp' 

J qt+i J ~oo 

with 

/ + 00 
P{p'\p)h{t + l,p')dp', 
-oo 

this gives us: 

Vt{Tr,p) = max { g{t,p) ■ ir' -T\7r - Tr'\ + h{t,p) ) 

\n'\<AI 

= max [ max {g{t,p) — T)Tr' + Ttt , max ig{t,p) + r)7r' — Ttt ] + h{t,p) 

7r<7r'<A/ —M<n'<n 

Using the definition of g{t,p) above, we can prove the following : 

• g{t, 0) = by symmetry 

• g{t,p) is continuous and strictly increasing with p: indeed, if we rewrite Equation ([2]) as 

/oo 
P{p'\p)g{t + l,p')dp' 
-oo 

where 

'g{t + l,p') if|p'|<gi+i 

g(t+l,p')=<r iip'>qt+i 

^ -r if p' < qt+i 

then it suffices to see that p i— t- g(t + l,p) is continuous and increasing, as well as the cumulative 
distribution fuctions p i— )• P>q{p) for g G M 

• limp_j>_|_oo5(i,p) = +00 because for p > 0, we can show that 'P>(jt+i(p) > V<^-q^^^{p) and 

/_t:,np'iPMi+i,p')>o. 



Thus, there exists a unique qt > which satisfies 

9{t,qt) = r 

and we finahy have: 

• if P > (?*, then g{t,p) > T so: 

yi(7r,p) =max [ AM + rvr , (A + r)7r ] + h{t,p) 

with A = g{t,p) - r. But in order to have AAf + Ttt < (A + r)7r we need it > M (or A = 0), 
so the maximum is reahsed for tt' = M, and 

yt{^,p) = {9{t,v) - r) • M + r^ + h{t,p) 

Note that, technicahy, in the case p = qt, then A = so that any vr' > vr maximises Vi(7r,p): 
the optimum is not unique anymore. But for the sake of simpHcity, we impose the solution 
vr' = M in that particular case. 

• if p < —qt, then g{t,p) < —T, and similarly we obtain a maximum for vr' = —M, and 

Vti7T,p) = {-g{t,p) - D • M - rvr + h{t,p) 

• if IpI < qt, then \g{t,p)\ < T, the maximum is realised for vr' = vr and 

^t(vr,p) = g{t,p)TT + h{t,p) 

D 

3.4 The self-consistency equation 

The solution given by Proposition [1] exhibits a dependency in t. 

Let us now consider the case where T — )■ oo, and suppose that the predictor is stationnary, i.e.: 
P{pt+i = p'\pt = p) is independent oft. Then we obtain a telescopic solution, where the dependency 
in t completely disappears, so that we only have a one- variable function g and a threshold q* , 
satisfying the following equations: 



g{p) -- 


= P + r • [P>,. (p) - P<_,* (p)] + /' P(p» g(j^) dp' 




9(g*) = r 



(4) 
(5) 



Equation (j4]) is a self-consistent functional equation. The optimal solution to the system is 
then: 

• if Pt > q* then vtj* = M 



if Pt < -9* then vr^* 



-M 



• if \pt\ < q* then vr^* = 7rj*_j^. 

Thus, we obtain a very simple trading system, always saturated at ibMaxPos, with a threshold to 
decide at each step whether we should revert the position or not. This of course looks a lot like the 
naive solution from Section 13.11 The only difference lies in the value of the threshold q* , defined by 
Equations (|4]) and ([5]), instead of (/naiVe = P^(^) for the nai've solution. Intuitively, those equations 
take our future trading into account, whereas the nai've solution does not. 

If we look closely at Equation (J4]), its interpretation becomes transparent: g{p) is equal to 1/2M 
times the expected difference in total future profit between the situation where ir = +M and the 
situation where vr = — M. This difference is made up of: 

• the term 2pM which represents the difference in immediate gain 

• 2TM ■ V^q{p) which represents the loss if the current position is —M and in the next time 
step the predictor goes over the positive threshold q* (hence vr will go to +M) 

• 2TM ■ V<:^q{p) which represents the loss if the current position is +M and in the next time 
step the predictor goes below the negative threshold —q* (hence vr will go to — M) 

• J^ P{p'\p) [2Mg'(p')] dp' which is the expected difference in total future profit if, in the next 
step, the predictor remains between the two thresholds (leaving vr unchanged). 

Since the change of position between — M and +M costs 2TM, it makes sense to compare 2Mg{p) 
with it and only trade when g{p) is greater than T. Hence, g{p) can be seen as the "gain per traded 
lot". 

Equation (j4]) already allows us to say that g(p) > p for any p > (and g{p) < p for p < 0). 
As g{q*) = r, this implies in particular that q* < T. This property is actually rather intuitive: 
indeed, if the immediate gain is higher than the trading cost, then there is no reason not to trade 
the maximal possible amount. 

3.5 Reformulation as a path integral 

Although Equation <^ is easy to interpret, it proves very difficult to solve in concrete cases like 
the Ornstein-Uhlenbeck case that we will consider in Section [H In the present section, we will 
extract an alternative equation for the optimal threshold, which, although more sophisticated than 
the self-consistency equation, will be easier to solve in practice. 

Equation (j3]) can be rewritten by expanding the function g: 



g{p) = p 



p'P{p'\p)dp' 



q* 






+ T- 



r- 



p{p'\pW + 

q' Jq* 



p"P{p"\p')P{p'\p)dp'dp" + ... 
q* 

00 fq* 

P{p"\p')P{p'\p)dp'dp" + ... 



-q r-q 

P{p'\p)dp' + / 

00 J —00 



P{p"\p')P{p'\p)dp'dp" + 



Let us now consider that the predictor starts at po = p at time t, and fohows the infinite path 
{pi,P2,P3, • • • ) afterwards. The above expansion tells us that this path will contribute to g{p) as 
long as —q* < pi < q*, and will stop contributing as soon as \pi\ > q* for some i > 0. Moreover, its 
contribution is given by the sum Y17=o Pi^ where n > is the first index such that |p,„| > q*, and by 
itr, depending on whether p„ > q* or pn < —q*- We will only consider predictors for which such 
an n exists, which is true with probability 1 thanks to Condition (jvj) in Section [2) indeed, if Pn is 
the probability for a path of length N, starting at po = p, to satisfy \pi\ < q for any 1 < i < N, 
then P/v ^ (1 — ^q) with < e^ < 1, so that limjv_^_|_oo Pi 

If we now set P{po, ...,pn\p) = P{pt+i 
equation: 



N 



0. 



Pi,i G [0,n] I pt = p), this leads to the following 



9{p) 



n=0 



fn-\ 



1 



^ — ' /'+00 ('q rq / '^ j^ \ "■ 

„_n L Ja' J-0* J-0* \i^Q J i^Q 

/n— 1 \ n 

X] Pi~^] ^(^'0' • • • 'P«ip) n ^P"- 



q' J -q' 
9* f-q* 



-q* J -q- 

Using now the fact that ^(g*) = T, we get: 

~" -oo cq* r-q* 



n=0 



+ 



q' J-q- 
-q* j-q* 

oo 



vi=0 



E Pi + r p(po, 

'n~\ \ 

E p, - r p(po, 



i=0 



>Pnk*) ^^n 



i=0 



,Pn|g 



i=0 



As we said above, there always exists, with probability 1, an integer n such that |p„| > q^ so: 



i = E 



n=0 



p(po,---,pnk*) n^^* 



i=0 



which leads to: 

oo 

E 



n=0 



+ 



fn-~\ 



'1* \i=0 



E Pi - 2r • i|p„<_g,} P(po, • • • ,Pn\q*) JJdpi = 



i=0 



where 1 is the indicator function. 

This can be perhaps more gracefully expressed as a path integral: for a finite path (f) : [0, n] — t- R, 
we note T^ = n, (j)t = <^(0), (pe = (p{n), V{(p\p) = P{pt+z = 4>{z), z G [0, n] | pj = p) and 
J (j){z)(iz = X]"=o '?^(*)- The equation above can then be symbolically expressed as: 



(6) 



Figure [T] sums up this reformulation of the problem: the value of q* is such that the "penalty" 
2r over all paths exiting through —q* is equal to the average gain (given by the sum of the values 
of the predictor) over all paths exiting either through q* or —q*. 



\4:e\>q* 










/ 


/ ^{z) dz - 


-2r-i{<^.<_,*}(<A) 


P{4>\q*) V<p = 


= 


(t>b=q* 

-q* <4^{z)<q* , ze]0,T4 












-2r 



Figure 1: Path integral representation 

There is actually a direct interpretation for Equation ([6]), based on Figure [H which is worth 
understanding. Imagine that we start with pt = p at time t, and the position before our trading 
decision is nt-i = — M. Knowing how we will trade in the future (which depends on the optimal 
threshold q*), we wonder whether it is worth reverting the position right now by buying Avr = 2M. 
Note that the reasoning below does not depend on the actual value of Avr, so one could as well 
consider buying just one share/contract. 

Let us suppose that we do trade this quantity Avr, with a cost of F • Avr. The next time we 
can possibly trade in the future is the first time T > t such that \pt\ > Q*- If Pt ^ Q* (path (f>i 
on Figure [1]), then we would have reverted our position anyway, so the cost of doing it early can 
be considered as null, and our gain is given Avr times the values taken by the predictor, hence 
Avr J (f>{z)dz. If on the contrary we have pT < —q* (path (j)2 on Figure [1]), then we will revert again 
our position to —M by paying F • Avr, hence a total cost of 2F • Avr, whereas the gain will also be 
given by Avr J (f)(z)dz. 

In the end, it is worth reverting the position to vr^ = M if, and only if: 



Avr 



\4>e\><]* 

P{<t)\q*) V(f> > 



I ^(/>(z)dz-2F-l|^^<_,*}(</.) 



<Pb=p 

-q* <,f>(z)<q* , z£]Q,T^[ 



As the optimal threshold determines exactly the limit between the trading zone and the no-trading 
zone, we recover Equation ([6]). 

Hence, the rather simple problem we introduced in the present article has a very non-trivial so- 
lution, which is best described through the above path-integral formulation. Note that Equation ([6]) 
is completely general provided the assumptions of Section [2] are satisfied, it does not rely on any 
specific statistics of the predictor. In the next section, we will explicitely solve this equation when 
the predictor is Gaussian and follows a discrete Ornstein-Uhlenbeck evolution. 



As a matter of notation, we will set: 

\<t>e\>q 

C{p) = 



I I 



(z) dz 



P{<P\p) VcP 



<Pb=P 
-q* «t>{z)<q* , ze]Q,T^[ 

e<—q* 



rip) 



I 



Pirn Vcl>, 



<pb=p 

-q*<4>{z)<q*, z&\0,T^[ 



which can be interpreted, respectively, as the average contribution of all paths before exiting the 
channel [—q*,q*]-, and as the probability for hitting the lower boundary —q* before the upper one 
q* . In terms of these quantities, Equation ([6]) now writes: 



C{q*) =2T-V{q*) 



(7) 



In some cases, both sides of this Equation will tend to be infinitesimal, so it is rather the ratio 
limp_>g. C{p) I linip^g. Tij)) that we will ask to take the value 2r. 

3.6 A note on non-linear predictability 

By looking at the proof of Proposition [H we note that the linearity of the predictability is not a 
crucial hypothesis. What we actually need is that 2,t{pt) = E[rt\pt] satisfies the following properties, 
for any t: 

• £,t is continous and strictly increasing 



lim 



Pt—>oo 



^tiPt) > r. 



With these hypotheses it is possible to prove once again Proposition [H except that Equation ([2]) 
becomes: 

rqt+i 
g{t,p) = Utip) + r • [V>,,^,{p) - P<_g,+i(p)] + / P{p\p)9{t + l,p')dp' 

■J -qt+i 

and the expression for Vt{TT,p) is similarly impacted. 

If we want to consider the telescopic solution, then we need to have a predictability independent 
of t, that is: -Cj(p) = -2(p). This gives, for the self-consistency equation: 

g{p) = £{p) + r • [r>g{p) - r<-q{p)] + f p{p'\p) g{p') dp' 

J-q 
9{q) = r 

This can again be solved using a path-integral formulation: 

|</'el>g* 



J ^£((^(z))dz-2r.l{^^<„,. 



P{(t>\q*) V(P = 



<Pb=q 

-q* <(l){z)<q* , 2:G]0,T^[ 
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4 Application to an Ornstein-Uhlenbeck predictor 

4.1 Definition 

We will now focus on the case of a predictor following a discrete Ornstein-Uhlenbeck dynamics: 

Pt+i - Pt = -e-pt + /3 ■ S,t (8) 

where (^t)tgR is a set of independent M{0, 1) Gaussian random variables. 

One classical example of such a predictor is an exponential moving average of price returns: 

t'<t 

If we suppose, as is usual, that the returns rj are J\f{0,ar) random variables, then this gives an 
Ornstein-Uhlenbeck predictor with e = 1 — p and /3 = Kdr- Note however that the rj must have 
some small correlations in order to be predictable! Therefore, in this case, the discussion in terms 
of an Ornstein-Uhlenbeck process is only consistent in the limit of small predictability, i.e. K <^ 1. 

4.2 Properties & Orders of magnitude 

Let us consider Equation ([8]) with the hypothesis that e <C 1. Then we have pt+i ~ e~^pt + /3 • ^t, 
so that 

E[ptJ^n\pt] w e-'>f 
So, r = e~^ is the auto-correlation time of the predictor pt. The standard deviation of the 
predictor, i.e. its average predictability, ap = -^/Elpf], is given by /9/\/2e (in the limit e ^ 1). 
Hence: 

• the smaller e is, the longer the predictor takes to express itself 

• the higher /3 is, the better the signal is (on average). 
The integrated predictability is given by 

oo oo 

Poc{p) = Yl ^[Pt+n\Pt] ~ Yl ^~"'^* 
n=0 n=0 

psp/e. 

This implies that the nai've threshold value is given by QnaiVe = Te, while the integrated average 
predictability is: 



/2e3 

In practice, if a real price predictor is to be both useful and realistic, it should beat the trading 
costs when the predictor value is a few times its standard deviations. This allows to obtain a 
system which trades regularly, but not too often, compared to its auto-correlation time. For our 
Ornstein-Uhlenbeck predictor, this implies that when pt oc /3/v2e, one should also have pte~^ oc F. 
Therefore, the interesting regime for practical applications is: 

/? oc Te^/"^. 

In what follows, we will study the problem by distinguishing between two cases: 

11 



• If /3 ^> r, the predictor can easily beat its transaction costs at every step. This situation 
(which is not very reahstic) requires us to keep a discrete time approach of the problem. 

• If /3 <C r, the predictor needs in general a large number of steps to beat the costs. This will 
lead us to a continuous formulation (and resolution) of the problem. 

4.3 Discrete case: /3 > F 

We already explained in Section [3.41 that we always have q* <T. Consequently, whenever /3 ^> F, 
we also have /3 ^ q* . This means that, starting at p = q*, one will typically jump beyond q* or 
—q* in just one step. Thus: 



Ciq*) = q* 



V{q*) 



+00 -x'^/2 



-dx with 



(2 - e)q* 



Since /3 3> q*, one has x* ^ 1, and thus V{q*) ~ 1/2. Equation ([7]) finally gives: 

o* =F 



(9) 



Hence, if the volatility of each predictor change is very large compared to the trading costs, then 
one needs to be as selective as possible. 

4.4 Continuous case: (3 <^V 

First, let us show why the condition j3 <^T leads us to express the problem in a continuous form. 
Under that condition, we cannot have an optimal threshold q* of the same order as magnitude as 
/3 itself. Indeed, if this was the case, any time the predictor has the value g*, it would have a 
significant probability to go below —q* at the next step since the predictor changes by an amount 
oc /3 at each time step. The optimal strategy would then require to resell everything at cost 2F, 
whereas the immediate gain would only be of the order of magnitude of /3. Therefore, Equation ([7]) 
could not be satisfied. 

Now, knowing that q* ^ /3, we need to evaluate V{q*) and C{q*). But for the predictor to go 
from (/* ^ /3 to —q* <^ —(3 requires many steps. Therefore, one is effectively is the continuum limit, 
where the variation of the predictor at each time step is infinitesimal compared to q* . We can then 
approximate the dynamics of the predictor by a drift-diffusion process: 



dp = -ep dt + /3 dXt 



(10) 



where {Xt)t is a Wiener process. 

In such a continuous setting, the quantities C{q*) and V{q*) are actually ill-defined because the 
diffusion process starts on an absorbing boundary. This is a classical problem, which is handled by 
starting infinitesimally close to q*. Therefore we consider C{p) and V{p) for p < q*. It is easy to 
show that these two functions obey two Kolmogorov backward equations, that read: 



1 ^ 
2'^ V 



dC 

^P IT 
op 



-p 



2^ dp^ 



dv 

ep —- = 
op 



ill) 



with boundary conditions: C{±q*) = and V{q*) = 0, V{—q*) = 1. 
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4.4.1 Solution 

Solving Equations (|lip with their boundary conditions leads to: 



"^"'''li'-il 



"e^^'dv 



e^-^^dv 



with 

1= [ e^^^dv and a 
Jo 



/?2- 



Setting now p = q* — u with u — )• 0, one finds that Equation ([7]) becomes, to first order in u: 



u uq* e"-'^* e°-'^* 

-- + — — ^ Tu 



eel I 

As expected, u disappears from the equation, to give the following solution for the threshold q*: 



— -— where F{x) = x — e I e 



q* = A= F-^ ( ^^ \ where F{x) = x- e"^" / " e""^ dv 



(12) 



Note that when e ^ 1, this equation can be expressed entirely in terms of the integrated predictabil- 
ity: 

p^{q*)=T-H{^^\ where H{x) = x F'^ (^ . 

This means that we can find the optimal threshold for a predictor by studying only its total predictive 
power (if we suppose of course that it satisfies all the required properties). 

4.4.2 Limits 

One can now study the limits of Equation (J12p for large and small values of the only remaining 
adimensional parameter t] = re^"//3. Interestingly, 7/ ~ 1 is the regime mentioned above where 
predictability beats costs whenever the predictor's value is of the order of its rms. 
The limiting behaviours of the function F{x) are as follows: 

• if X » 1, then J^ e" dv <^ e^ , so F{x) w x 



if X ^ 1, then F{x) ~ x — {I — x^) j^{l — t!2)dt 



2x^ 
3 ■ 



Therefore when rj ^ 1, the threshold is simply given by q* = Te. This result is rather intuitive: 
if /3 is very small then the predictability of the predictor is weak, compared to the trading cost. 
Hence, it makes sense to try to catch any profitable opportunity, without taking future trading into 
account. That is why we recover the naive solution of Section (13. ip . 



If on the other hand /3 ^ Te^" then r/ ^ 1, and F ^(?/) ~ \/^ ' V^ which finally gives 



"l-rp^ (13) 
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This is the resuh that would obtain with a predictor foUowing a Brownian motion. Indeed, if 
/3 is large enough, the mean-reverting effect e is not relevant, and the optimal threshold must 
consequently be independent of e. 

4.5 Shape of the global solution 




Figure 2: Optimal thershold as a function of /3 

We summarize the various regimes in Figure [21 where we plot the optimal threshold q* as a 
function of /3, for e and F fixed. One can see the three main caracteristics of this solution: 

• a constant threshold q* = Fe for small values of /? ^ Fe^' ^ 

• a sublinear behaviour q* oc F^' ^/3^' ^ in the intermediate regime between /3 ~ Fe^' ^ and /3 ~ F 

• a constant threshold g'* = F for large values of /3 ^ F. 

Note that these different results match seamlessly at the boundaries between the regimes. Indeed, 



q* = f/l • F/3^ becomes of the order of Fe when /? ~ Fe"^'^, and becomes of of F when /? ~ F is of 
the order of F. One can also compute systematic corrections to q* = T as an expansion in F//3: 



(1-6) 



TT 13 



/3>F. 



4.6 The case of a white noise predictor 



To conclude, let us consider the special case of Equation ^ where e = 1. In that case, the predictor 
is a white noise in time: E[ PtPt+i ] = 0. Since we assume a perfect p i— )• —p symmetry, the self- 
consistency equation becomes simply g{p) = p, which trivially implies that q* =T in that case. This 
is also consistent with our explicit solutions above: when e = 1, the intermediate regime disappears 
and one indeed finds q* = Te = F. This threshold is what we expect from such a system: without 
any auto-correlation, the best strategy is to trade as soon as the instantaneous predictability is 
above the trading cost. Note that in this case poo = Pt, so this threshold also coincides with the 
nai've solution. 
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5 Numerical results 

To check the robustness of our analytical results, we ran some simulations to determine the optimal 
threshold numerically, to be compared with the theoretical value we obtained in Section U] 

In what follows we set e = 0.001. We can set F = 1 without loss of generality, and the only 
remaining variable is (3. 

The algorithm to find the optimal threshold for a given value of (3 runs as follows: 

1) We choose a set of threshold values qi,...,qn uniformely distributed over a reasonably large 
range (which contains the theoretical optimal threshold). 

2) We generate a long random path {pt)te[o,T] for the predictor, following the law given by Equa- 
tion dl]). 

3) For each value of the threshold, we simulate the behaviour of the corresponding strategy, with a 
MaxPos of 1. 

4) To obtain the P&L of each system, we calculate at each time step t the gain given b}o Pt " T^t 
(where VTf is the position of the system) and the cost given by r|(57rt| if there is a trade JvTi. 

5) We select the threshold qj with the maximal total P&L. 

6) We choose new values q[,...,q'^ for the threshold, distributed around qj, and we restart the 
algorithm with these values. 

This loop is repeated several times, in order to get a sufficiently precise approximation of the optimal 
threshold. The result of this process is shown in Figures |3] and HI The analytical solution in the 
continuous case, given by Equation (I12p . is easy to compute (the function D(x) = e~^ L e" df is 
a classic, called the Dawson function). 

By comparing the two curves, one can check that the analytical solution is indeed a good fit of 
the simulation results. However, there are two interesting details to note: 

• when the value of /3 becomes very small (Figure [3]), the solution of the simulation becomes 
very noisy; 

• there is a discrepancy between the analytical and simulated solutions when /3 increases (Fig- 
ure |4]) and, rather surprisingly, this happens in a regime where the inequality /3 <C F still 
seems to hold. 

The reason for the first effect is obvious: when /3 becomes very small, then the predictor hardly 
ever beats its optimal threshold Fe during the course of the simulation, which implies that the real 
optimal threshold becomes very difficult to find: we need to use a huge number of steps in the 
simulation to find the proper solution. 

The answer to the second remark is more subtle: if we go back to Section 14.41 we see that it is 
not really the hypothesis /3 ^ F which implies that we are in a continuous setting, but rather the 
inequality P <^ q*. Of course this second inequality was derived from the first one, but we did not 



■^Indeed, as we only consider expected values, there is no need to generate a random variable for the price return 
rt = Pricet+i — Pricet as a function of the predictor pt : one can directly consider the mean of this variable, which is 
exactly pt- 
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Figure 3: Optimal threshold for < /3 < 1.2 • 10" 
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Figure 4: Optimal threshold for < /3 < 0.003 



specify at that time the orders of magnitude involved in each inequality. In fact, if we consider for 

1.15. This means that 



with K 



the threshold g* the limit value a'/| • r/3^, one sees that \ = '^ \i ^ 
our so-called continuum hypothesis breaks down much sooner than a nai've comparison between /3 
and r would tell. In Figure [U if we look at the point for which /3/r ~ 0.002, one actually has 
(7*/r ~ 0.144; so the continuum hypothesis is already unwarranted for this value, and the solutions 
do not strictly coincide, even though the ratio between /3 and F is still extremely small. 



6 Extension to other risk constraints 

6.1 The "band" system 

The problem we have presented and solved in the previous sections was rather specific, as it required 
a risk control based on the only constraint |7r| < M. An alternative, more classical way to handle 
risk, is to consider a quadratic penalty R{irt) = vr^ that represents risk aversion. The "utility" to be 
maximized is at each step t given by gt = rfiTt — Avr^. Note that the penalty term could read, more 
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generally, R{'iTt) oc \Trt/M\^. The case z = 2 is the above quadratic penalty, whereas the constraint 
on the maximum position formally corresponds to the limit z — t- oo. 

The quadratic problem was considered in |MS1H IMarl2| , where it was showed that the optimal 
strategy in this setting is a band, also called a DT-NT-DT (Direct Trading - No Trading - Direct 
Trading) system. This system is defined as follows: 

• there is a one-to-one correspondence between the values of the predictor and the position (in 
the quadratic case, this is simply given by: vr = p/2X) 

• at each time step, one defines a band of size 2q* around the value of the predictor 

• if the image of the current position is inside the band then there is no trade; but if it is outside 
of the band then the trade should bring the image of the position to closest border of the band. 

Note that we did not explicit here the position of the predictor pt inside the band. It is actually 
proven in |MS11| that it does not have to be at the middle of the band, but it does when the trading 
cost becomes small. The band will be called symmetric in this case. This symmetric band is a 
property that will be necessary for our argument below to work. 

This band system looks at first sight rather remote from the system we studied in the previous 
sections. Still it was shown in |MS11| that if the asset price follows a mean-reverting dynamics 
(which corresponds exactly to the case of a continuous Ornstein-Uhlenbeck predictor) then the 
optimal half-size of the band is given, with the notations of the present paper, by: 



■/^•r/32 



when the trading cost T is small. This is exactly the value of our threshold in the continuous setting, 
when costs are small! 

Our goal in the next section is to explain that this in no coincidence. This will allow us not only 
to recover the results of |MS11| . but also to extend them by giving the optimal solution under the 
condition that the band is symmetric. 

6.2 Optimal size of the band 

In what follows, we suppose that we already know that the optimal trading policy is a band system 
as described above, with the predictor being at the center of the band, and we will find the value of 
the half-band for this optimal system, under some condition on the current value of the predictor. 

Consider that we start with pt = p, and the position before the next trading decision is at the 
lower border of the band: ttj-i = (p — q*)/2X. Knowing that the future trading style is a band of 
optimal size 2q*, we wonder if it is worth buying an infinitesimal quantity 6tt at t. 

Let us suppose that we buy this quantity. The new risk penalty term will then be 

R{TTt) = X{iTt-i + Sirf 

= Ri'Kt-i) + (P - q*) ■ Stt + 0(571^) 

Now let us follow the scenarii given by Figure (5) by definition of the band system, we will not 
trade until the predictor's value becomes larger than p or smaller than p — 2q* . In the former case, 
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p + q 



p-q 



p-2q 




2r 



Figure 5: Paths in the band system. The band is the region between p — q and p + q, the paths (j)i 
and (j)2 are the predictors trajectories defining the two possible ways to get out of the band if we 
start at the lower border. 

then the trading cost is zero because we would have bought dir anyway, whereas in the latter case 
we will have to sell it back, which gives a total cost of 2T ■ 5tt. 

In both cases, if we note (p the path between t and the first T such that Pt ^ P or p^^ < p — 2q* , 
then the gain of trading 5tt is given by 



Qs-n 



{(t){z) ■ 6^ - 5R) dz 



where 5R = R{-Kt) - R{T^t-i) — (p - q*) ■ (^vr. 

In the end, we have a positive gain by trading 6tt if, and only if: 



\'l>e-p+q*\>q* 



p-2q'«t,(z)<p, 2:G]0,T^[ 



dz - 2r • 1 



{<t>e<p-2q"}\ 



P(0|p) P0 > 



We can then do the change of variable ip{z) = (j){z) — 'p+q*, but in order to have P{il)\q*) = P(</)|p) 
we need to be in a case when the drift of the predictor can be neglected, i.e. when e is small enough: 
q* < (5/y/e. This gives (as (Jvr > 0): 



\i>e\>q* 



j y"v'(z)dz-2r-i{^^<„g*}(v^) 



P{i)\q*) Vtp > 



■>Pb=q* 

-q*<i>iz)<q*, z€]0,T^:[ 



The limit of the no-trading zone is the point where the cost of trading equilibrates precisely the 
gain, so the above inequality becomes an equality, and we recover exactly Equation ([6])! This means 
that the same equation defines the optimal threshold in the quadratic risk-control case with small 
trading costs and in the "MaxPos" setting. 
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6.3 Consequences 



As a consequence of our above result, we recover the optimal half-band size q* = yf • T/S^ for small 
values of F, and in particular the two-third dependency on the costs explained in |Rog04| . This 
also gives us the optimal value for any F, provided that the band is symmetric and q* <C P/ \/e: 
this value will be given by Equation [121 This can be seen as a "variational solution", where we get 
the optimal solution in a close-to-optimal subspace of the space of possible trading systems. But 
in reality, as shown in |MS11| . there is a small shift of the center of the band, which is however of 
higher order in T^'^ . 

Moreover, our result is more general than the case of an Ornstein-Uhlenbeck predictor: Equa- 
tion ([6]) works for any price predictor, whatever its dynamics, provided it satisfies the hypotheses 
of Section [2j And the technique based on Kolmogorov backward equations, that we presented in 
Section [4.41 can easily be extended to these various dynamics. 

Finally, it can be proven that for any reasonable risk- function R{'k) - for example i?(vr) = |7r|^ for 
any z > 0, if the trading system is a symmetric band then its half-size is again given by Equation ([6]). 
The limit z — ?• oo is singular in the sense that one loses the one-to-one correspondance between the 
predictor and the size of the trade, but formally our result holds for arbitrary z, and therefore 
applies to the "MaxPos" system considered in the previous sections. 

7 Conclusion 

We have considered and solved exactly the problem of the optimal trading strategy when one wants 
to follow a completely general Markovian predictor of the future returns of a single asset, in the 
presence of linear costs, and with a strict cap on the allowed position in the market. Using Bellman's 
backward recursion method, we have shown that the optimal strategy is to switch between the 
maximum allowed long position and the maximum allowed short position, whenever the predictor 
exceeds a threshold value, for which we establish an exact, non-trivial, equation. This equation can 
be solved explicitely in the case of a discrete Ornstein-Uhlenbeck predictor. We discussed in detail 
the dependence of this threshold value on the transaction costs. 

We also showed an unexpected relation between our problem and the problem where risk is 
handled dynamically, with an arbitrary risk penalty. The connection relies on the presence of a no- 
trading zone for the two problems, which allows a use of powerful Bellman techniques to calculate 
the optimal parameter. 

There are various interesting extensions of our results that one can think of. One could consider 
the case of a predictor with jumps, and see how this affects the threshold value. One could consider 
an asymmetric risk constraint, where the maximum long and short positions are different (this 
could be relevant for option trading). But the most relevant extension would be to consider the 
case, important in practice, where the costs have both a linear component (coming from fees, bid- 
ask spread, etc.) and a quadratic component that would model impact. One may hope that an 
exact solution is still available in some regime, at least in the case of a single asset, or when the risk 
constraints do not couple different assets. 
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